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A FACTOR ANALYSIS OF VERBAL ABILITY 


By Donato M. JoHNson and Fioyp ReyNoLps 
Fort Hays Kansas State College 


The investigation reported here was undertaken because of the 
writers’ interest in the processes underlying problem-solving. Fa- 
miliarity with the experimental literature on the higher mental 
processes, such as Woodworth has summarized in the last two chap- 
ters of his recent work (16), suggests that there may be two funda- 
mental processes involved in solving all problems: (1) calling up 
various acts or responses, and (2) selecting those responses which 
meet the requirements of the problem. 

Comparative psychology teaches that in the first process, call- 
ing up responses, the higher, more intelligent species show more 
variability. They have a larger repertoire of responses. Munn 
states, for example, that “stereotypy decreases and general plasticity 
of response increases as the higher forms are approached” (7, p. 
100). Clinical psychology shows that the bright use more different 
methods of performing than the dull. Experiments on human 
adults give similar results and suggest that individual differences 
in this process may be an important factor in solving problems. 
Let us call this hypothetical factor F, for flow of responses. 

But problems are not solved by aimless activity alone. The 
problem-solving experiments, both animal and human, show restric- 
tion of the responses, under the influence of some sort of direc- 
tive tendency, to the more likely places, objects or ideas. Complete 
solution, including transfer, implies recognition of the fit of cer- 
tain responses to the requirements of the problem. Let us assume 
that individual differences in this process may be another important 
factor in solving problems. Let us call this hypothetical factor S, 
for selection of responses. 

It would be difficult to set up an experiment to isolate either 
of these processes. In operation each is complicated by the other. 
Individual differences in the two processes may be studied, how- 
ever, by factor analysis. It seemed likely that, if a suitable bat- 
tery of tests could be given, factor analysis of the scores would 
yield useful information as to the presence and importance of these 
processes. 

We do not know, of course, whether flow of one kind of 
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responses, e.g. verbal, is related to flow of another kind, e.g. nu- 
merical. Hence, we have confined our investigation to verbal tasks. 
In statistical analyses of intellectual ability the verbal factor has 
consistently appeared as a factor of considerable importance. (See 
1, 2, 4,6 and 13.) Any information regarding the processes involved 
in solving verbal problems would, therefore, advance our knowl- 
edge of intellectual ability. Our hypothesis is that factor analysis 
of a number of verbal tests will disclose Factor F, flow of responses, 
and Factor S, selection of responses, in significant amounts. 


Tue Tests 


The selection of tests for factor analysis is an important mat- 
ter. For present purposes the range of verbal ability should be 
covered. There should be some tests which could be presumed 
to include F and S in varying amounts. Yet some of the tests 
should be so simple that the resulting statistical factors—whatever 
they might turn out to be—could be interpreted without ambiguity. 
Ten tests were used, five of which were standard tests, and five 
constructed ad hoc. 


1. Flow. This test was intended to measure F, or flow of 
responses, in as pure a form as possible. It was suggested by a 
test on the ten-year level of the old Stanford-Binet (11). The 
directions were to write as many words as possible in two min- 
utes. 


2. Multiple Completion. This test was similar to Flow but 
required some selection or fitting of the responses. Sample: 


EE ae ee eae 


Each blank was to be completed by a word which, if used alone, 
would complete the sense of the sentence. Twelve minutes. 


3. Paired Opposites. Instructions were given to name as many 
words and their opposites as possible in four minutes. This test 
was expected to depend largely upon flow of responses. Some 
degree of restriction is evident, however. 


4. Word Cross-outs. The subjects crossed out irrelevant words 
that had been inserted in the text of an article taken from a 
popular magazine. For example: 
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Midget auto racing has developed servile a handicapping system 
based on the... 


Servile is to crossed out. The content of this test could not be 
estimated in advance. It was included for the sake of variety. 
Three minutes. 


5. Same and Opposite. This test was taken from the 1931 
A. C. E. examination (15). There are four words on a line, 
two of which are either the same or opposite in meaning. The 
subjects were to identify the two words. Five minutes. Sample: 


1. obtuse, 2. aromatic, 3. acute, 4. occult. 


6. Verbal Relations. Two types of verbal problem were used 
in this test. Samples: 


fo) Piast ............ meals 
(b) Irksome is to wearisome as engrossed is to ...........- 
1. enhanced, 2. occupied, 3. apathy, 4. dilapidated. 


The problem in (a) was to find a word similar in meaning to 
both plank and meals, e.g. board. This type of item was sug- 
gested in a recent report by Shipley (10). The (b) items were 
of the usual analogy type. Fifteen minutes. 


7. Vocabulary. The vocabulary section of the Nelson-Denny 
Reading Test (8) was used to get vocabulary scores. It con- 
tains five-choice items, such as: 


To feign is to 1. fret, 2. faint, 3. molest, 4. pretend, 5. portend. 
Ten minutes. 


8. Restricted Completion. This test was the completion test 
of the 1931 A. C. E. examination (15). Sample: 


A dead body, especially a human one, is sometimes called a 
are 


The number in parenthesis indicates the number of letters in 
the most appropriate word. It is here called a “restricted com- 
pletion” test because the response is restricted in two ways: mean- 
ing and length. In Test 2, Multiple Completion, there is much 
less restriction or fitting. Eight minutes. 
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Factor analysis of these eight tests could be expected to disclose 
some information about individual differences in verbal ability. If 
a small number of factors should be found, it would be interesting 
to investigate their involvement in a practically important verbal 
skill, such as reading. Therefore a reading test was included. An 
intelligence test was included because, the items on intelligence tests 
are usually minor problems to be solved, and because intelligence- 
test scores are usually closely related to verbal proficiency. 


9. Intelligence. The Henmon-Nelson Test of Mental Ability, 
Form A, for college students was used (5). This is a well- 
known multiple-choice test. Approximately half of the items 
seem to be verbal. Thirty minutes. 


10. Reading. The Comprehension section of the Nelson- 
Denny Reading Test, Form A, was used (8). The subjects 
read paragraphs and answer multiple-choice questions about 
them. Twenty minutes. 


These tests were all given with time limits. The times, except 
for the standard tests, were selected after a little preliminary ex- 
perimentation. When the reliability coefficients are presented, it 
will be evident that some of the tests should have been longer. 


SuBJECTS 


The tests were given during three class hours to 120 summer- 
school students at Fort Hays Kansas State College. The group was 
somewhat heterogeneous in respect to age since some of the subjects 
were teachers who had been teaching a few years. The records of 
113 subjects were complete and constitute the data for the factor 
analysis. 


STATISTICAL TREATMENT 


The ten tests were scored, and means and SD’s for the ten dis- 
tributions were computed in the usual way. These are presented in 
Table I. Table I also shows the reliabilities of each test. For the 
standard tests the reliability reported by the author was used. In the 
case of the Nelson-Denny Reading Test (8), which consists of two 
parts, Vocabulary and Reading, the reliability of the total score only 
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TABLE I 


MEANS, DISPERSIONS AND RELIABILITIES FOR 113 
COLLEGE STUDENTS ON TEN VERBAL TESTS 


Reliability 

Test Mean SD Coefficient 
1. Flow 54 8.3 603 
2. Multiple Completion 137 34.4 885 
3. Paired Opposites ae 4.8 405 
4. Word Cross-outs mm 7.4 844 
5. Same and Opposite il 6.2 87 
6. Verbal Relations - 16 4.3 785 
7. Vocabulary F : 43 15.2 842 
8. Restricted Completion 18 6.5 81 
9. Intelligence : 44 10.6 89 
10. Reading 20 6.3 842 


is given. The assumption was made that both parts are of approxi- 
mately equal reliability, and the Spearman-Brown “prophecy” 
formula was used to estimate the reliability of each part. For the 
newly constructed tests reliability was computed by the split-half 
method with the aid of the Spearman-Brown formula. Since two 
kinds of items were used in Test 6, items 1, 2, 5, 6, 9, 10... were 
correlated against items 3, 4, 7, 8, 11, 12 .... In Tests 1, 2, 3 
and 4, where the score was simply the number of correct words, 
the subjects were stopped at half-time and asked to draw a line. 
Then all were started again at the same place. First and last halves 
were correlated to get reliability coefficients. Some of these reliabil- 
ity coefficients are probably not highly accurate but they are ac- 


TABLE II 


CORRELATION MATRIX FOR TEN VERBAL TESTS 
Raw Correlations Below the Diagonal; Corrected Above 


l 2 3 4 5 6 7 8 9 10 
455 373 38 27 ZF A48 S2 133 1 
2  =«.303 a 2 Fe @ 2 te SS 
5 2 «CAG 387 «8317 6567) 611 =A 153) = 288 
4‘ 22 19 i 325 650 577 =.452)— 750695 
> 20 205.37 £2 811 829 659 .730 .686 
6 .189 .158 .320 .529  .670 688 .754 821 .642 
7 129 171) =.240))S 486.710 559 548 .665  .717 
8 221 102 2 31 333 el ASS 562 = .505 
9 099 .269 092 .650 .642 .686 576 .477 .764 
10.132) 340.139) 586.587) 603 S604 = 417—s«662 
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curate enough for present purposes. Test 3 has very low reliabil- 
ity and does not contribute much to the present investigation. The 
time for Test 3 was only 4 minutes. 

The scores were next transmuted into sigma scores and the inter- 
correlations among the ten tests computed. Raw and corrected 
correlations are presented in Table II. Some of the correlations in 
Table II are rather high and some are almost zero. None is nega- 
tive. 

Thurstone’s centroid method of factor analysis (12) was used 
in the present study. Its convenience and utility have been pointed 
out by Garrett (2) and by Guilford (4) among others. Guilford’s 
outline (3) was followed. The analysis was carried through with 
raw r’s and with corrected r’s. Since the variance due to unreliabil- 
ity varied considerably from test to test, it seemed to the writers 
that the factor loadings resulting from the use of corrected r’s could 
be more clearly interpreted. Roff (9) has shown that both correla- 
tion matrices have the same rank. The analysis was carried through 
twice. At the close of the second analysis the communalities were 
quite similar to the communalities used in computing the first fac- 
tor loadings. 

After two factors had been extracted, the distribution of residual 
correlations had a PE of .056. The PE of a zero r when N = 113 
would be .064. We are not justified, therefore, in extracting a 
third factor, especially since only ten tests were used. The first 
factor pattern is shown in Table III and plotted in Figure 1. 


TABLE III 
FIRST FACTOR PATTERN 


Factor weights 


Test I II h? 
l 303 —.476 319 
2 361 —.355 .256 
3 380 —.173 .174 
4 .697 —.080 .492 
5 .810 .232 710 
6 801 .202 .682 
7 743 .265 .622 
8 .619 .281 462 
9 Pe .183 .634 

10 758 —.161 .601 
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FIGURE 1 


Plot of ten test vectors in reference to the centroid axes I and II. 
After rotation the axes are called I’ and II’. 
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In rotation only one principle was followed: the principle of 
positive manifold. Rotation 60° clockwise suffices to render all 
factor loadings positive. It can be seen from Figure 1 that some 
leeway is permitted in rotating. Rotation of a little more than 60 , 
or a little less, would have given a positive manifold. We have no 
grounds for assuming that either Test 1 or Test 8 measures a 
factor in purer form than the other. It seemed best, therefore, 
to place the reference axes approximately the same distance from 
each. The axes after rotation are called I’ and II’. 


TABLE IV 
FINAL FACTOR PATTERN AFTER ROTATION 


Factor weights 


I’ Il’ h? 
Test (flow) (selection) 

1. Flow , 564 .024 BS 
2. Multiple Completion .488 135 .256 
3. Paired Opposites 340 .246 .174 
4. Word Cross-outs —— 564 .492 
5. Same and Opposite . .204 818 .710 
6. Verbal Relations .226 795 682 
7. Vocabulary 143 .776 622 
8. Restricted Completion .067 677 .462 
9. Intelligence .230 .763 634 
10. Reading 516 575 601 


In interpreting the rotated factor pattern one usually looks for 
similarities of material among the tests with high loadings in a 
factor. In the present problem the material is presumably of one 
kind and we therefore look for similarities in ways of handling 
the material, i.e. processes or functions. Test 9, Intelligence, and 
Test 10, Reading, are relatively complex and are not immediately 
helpful in the interpretation. 

If we look at the loadings in I’ in the light of the hypothesis 
outlined in our introduction, it becomes easy to identify Factor 
I’ as our hypothetical Factor F. The tests with the largest load- 
ings in this factor are those which are most dependent on flow of 
responses. Going from Test 1 to Test 8 we see that, as the free 
flow of responses is restricted, the factor loadings approach zero. 

By similar reasoning Factor II’ can be identified as our hy- 
pothetical Factor S. The factor loadings do increase as one goes 
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from the tests involving free flow of words to those involving the 
selection of the words which fit the requirements of the problem. 
The interpretation of the factor pattern in accordance with our 
hypothesis seems quite clear-cut. 

A secondary purpose of the investigation was to determine to 
what extent Factors F and S are involved in general intelligence 
and in reading. Table IV shows that the Henmon-Nelson intelli- 
gence test correlates .230 with Factor F, flow of responses, and 
.763 with Factor S, selection of responses. One would expect S 
to be a greater factor in intelligence tests than F. It is an interest- 
ing finding, nevertheless, in view of the fact that the test is a timed 
test. This test has fairly high correlations with other group intelli- 
gence tests and may be taken as typical. 

Factors F and S seem to be about equally involved in the read- 
ing scores. The reading test yields one of the highest loadings in 
Factor F. It is well known that scores on tests of intelligence, vo- 
cabulary and reading are rather closely related. The correlations in 
Table II demonstrate these relationships again. Table IV, how- 
ever, gives new information about similarities and differences in 
the processes operating in these three test situations. 


Discussion 


This is not the place for a critical discussion of factor analysis. 
Fortunately, several of the controversial issues in factor analysis 
are evaded by the simplicity of the present experimental design. 
For example, the rotation which is necessary to clarify the factor 
pattern is obvious from inspection of Figure 1. There would prob- 
ably be no difference of opinion on this point. As to the inde- 
pendence of the factors there would be little dispute since Tests 
1 and 8, which best define the two factors, show a corrected corre- 


lation of only .030. (See Table IT.) 


It should be noted that some of the communalities are not very 
large. Extracting two factors leaves much of the variance un- 
accounted for. Speed of writing might account for some of the 
variance in the present analysis. A more complete factor analysis, 
determining the factorial composition of verbal tests in more detail, 
would use a larger number of tests—and might disclose other im- 
portant processes. Tests of verbal association and of attention 
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span for words and concepts should be included. Our purpose in 
the present investigation was not to do a complete factor analysis, 
but to determine the importance of two processes which had been 
postulated by experimental rather than factorial methods. In spite 
of the small scale of the present study and the inexactnesses which 
have been pointed out above, the analysis indicates that the two 
processes considered here, calling up words, and selecting those 
which meet the requirements of the problem, are involved in solving 
problems of a verbal nature. The effects of individual differences 
in each of these processes on the scores in each test is indicated 
by the factor weights, but the writers would not claim much ac- 
curacy for the factor weights because of their dependence upon the 
selection of tests, selection of subjects, manner of scoring, et cetera. 
The relative position of the tests in terms of their loadings with 
each factor is probably fairly accurate. 

Materials versus processes. Statistical analyses of intellectual 
ability have emphasized materials rather than processes or opera- 
tions. (See 1, pp. 313-315, and 4, pp. 375-382.) This may not 
be true in the future. It may be that similarities in materials are 
easier to see than similarities in processes, and thus attract atten- 
tion first. When the same material is used in all tests, as in the 
present study, processes are, of course, emphasized. If future fac- 
tor analyses disclose other processes as important factors in prob- 
lem-solving, one would expect to hear psychological terms, such as 
“grasp” and “elaboration of alternatives,” rather than logical terms, 
such as “generalization,” “analogy” and “deduction.” 

An interesting task for the future would be to determine whether 
the same or similar processes are involved in solving other kinds 
of problem, e.g. numerical. A priori, one might expect some simi- 
larity in fitting responses to a verbal problem and to a numerical 
problem, but not in flow of verbal and of numerical responses. 
Probably S is a more general factor than F. 

Note that Tests 5, Same and Opposite, 6, Verbal Relations, and 
9, Intelligence, in which the items are minor problems and the 
responses are highly restricted both formally and functionally, have 
factor weights in F which appear to be significant. This suggests 
that flow of relations, concepts, schemas (16, p. 798), hypotheses, 
insights, plans of action, et cetera, may be included in Factor F. 
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These are probably all verbal for most people. One biological 
process could conceivably yield the most logically diverse intel- 
lectual products. ; 


Is verbal ability unitary? Tests 5, 6, 7, 8 and 9 are of the kind 
commonly used in factor analysis. Figure 1 shows that verbal abil- 
ity is practically unitary as far as these five tests are concerned. 
The question is whether the other five tests can be ignored in a 
general description of mental ability. Test 10 is a reading test, 
with approximately equal weights in both factors. Reading is, of 
course, an important verbal skill; it cannot be ignored. The other 
four tests are novel tests, constructed to test a particular hypothesis. 
But they are certainly verbal tests. It is the writers’ opinion that, 
if verbal ability is considered as the capacity to use verbal material 
in the solution of verbal problems, at least two dimensions are 
necessary for its factorial description. 


Are other interpretations possible? Since the factor pattern was 
interpreted and the factors named in the light of a particular 
hypothesis, it is desirable to look for alternative interpretations. 
Reconsideration of Table IV suggests that Factor I’ could be called 
a speed factor and Factor II’ a precision factor. But these are 
probably merely alternative names for the factors we have previous- 
ly called by more descriptive names. I’ cannot be an unqualified 
speed factor because all tests were timed. It must be speed of a 
sort which is especially needed on Tests 1, 2, 4 and 10. 


Professor Thurstone (14) has recently discussed two verbal fac- 
tors discovered by centroid analysis. V is the well-known verbal 
factor, and W, a word-fluency factor, “is revealed by readiness in 
producing words to fit a specified set” (p. 200). The similarity 
of Thurstone’s description of W and our description of F is strik- 
ing. It is possible that our S is the usual V or verbal factor 
while F is the newer W or word-fluency factor. If they are the 
same, their independent discovery is an argument for the validity 


of the method. 


Thurstone adds that “the recognition form of an opposites test, 
for example, should require little or no W, whereas a free-writing 
form of the same test should have high saturation in W” (p. 201). 
This statement holds only superficially for our F. Test 8, which 
has the least F, is formally a free-writing completion test. Psy- 
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chologically, however, the flow of responses is highly restricted. 
Here is an opportunity for confusion between formal and func- 
tional characteristics of a task. V and W are significantly corre- 
lated (p. 201) while F and S are not. If a wider variety of tests 
had been included in the present study, would F and S have been 
correlated? 

A methodological note should be added. This investigation was 
planned as a test of a particular hypothesis. Then the factors were 
named in the light of this hypothesis. Such a procedure introduces 
the possibility of bias in selecting the tests and in interpreting the 
factor pattern, a possibility which is greater with factor analysis 
than with more objective statistical tools. Nevertheless, the bene- 
fits of a hypothesis in planning the investigation and selecting the 
tests far outweigh the dangers of misinterpretation. Thurstone (14) 
points out the advantages of developing a hypothesis from a fac- 
tor analysis, then testing this hypothesis by a subsequent factor 
analysis. The writers would add that the use of factorial methods 
to test a hypothesis developed by different methods—in the present 
case, by experimental methods—has its advantages also. 


SUMMARY 


The literature on problem-solving among animal and human 
subjects suggests that there may be two fundamental processes 
involved in solving all problems: (F) the flow of various acts or 
responses, and (S) the selection of these responses according to 
the requirements of the problem. This leads to the hypothesis that 
individual differences in these processes would be important in 
determining scores on tests which include the solution of problems. 

The investigation was limited to the solution of verbal problems. 
Factor analysis (centroid method) of ten verbal tests disclosed two 
independent factots which on rotation can be identified as the 
hypothetical Factors F and S. The importance of these factors in 
such common tests as vocabulary, intelligence and reading is shown 
by the factor weights. It seems quite clear-cut that the two processes 
or functions suggested by the experimental literature are important 
in solving verbal problems, and that individual differences in F, 
flow of responses, and S, selection of responses, are responsible 
for a large share of the variance in scores on verbal tests. Relations 
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between the findings of this investigation and problem-solving in 
general are discussed. 


Fort Hays Kansas State College 
Hays, Kansas 
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